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(57) Abstract 

This invention provides nucleic acid engineered to produce peptides related to human mannose binding protein. 
These peptides are used in diagnosis and treatment of diseases. 
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HUMAN MANNOSE BINDING PROTEIN 

Background of -the Invention 

5 This invention relates to proteins able to 

bind mannose. 

Mannose-binding proteins (MBPs) have been 
isolated from rabbit, rat, and human liver. 
Taylor et al. Clinical Science 70:539, 1986. MBPs 

10 have also been found in serum, and may play a role 
in the disposal of pathogenic organisms. Id. 

Summerfield et al. Bioc, Biop > Acta 883 : 197, 
1986 describe two types of MBP's in human serum. 
These were detected using antibodies raised 

15 against a 30 kDa subunit of one MBP . The authors 
suggest that MBPs may bind noxious glycoproteins 
in the circulation prior to the removal of these 
glycoproteins; and that yeasts and bacteria 
contain glycoproteins which and are bound by MBPs. 

20 Stahl et al. Biol. Cell 51:215, 1984 

describe a mannose receptor, which is distinct 
from MBPs. These two proteins appear to be 
structurally related since antibodies to one 
protein may react with the other protein. 

25 Wild et al. Biochem. J\ 210 :167, 1983 

describe the isolation of MBP from human and rat 
liver. The human MBP has a molecular weight 
greater than one million and consists of 28 kDa 
and 30.5 kDa subunits. 
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Drickamer et al. Biol . Chem. 261 :6878, 
1986 describe the isolation of MBPs from rat 
liver, and the cloning of cDNAs encoding these 
proteins. Each MBP has a cysteine rich region, a 
5 collagen-like domain and a carbohydrate binding 
domain* 

Summary of Invention 

In one aspect, the invention features 

10 engineered nucleic acid encoding for at least 
about 20 contiguous amino acids of human 
mannose-binding protein, or having at least about 
60 or 90 bases able to hybridize under hybridizing 
conditions to nucleic acid encoding human 

15 mannose-binding protein. By engineered nucleic 
acid is meant nucleic acid removed from its 
natural environment by recombinant DNA 
methodology, or synthetic nucleic acid, or cDNA. 
This nucleic acid may be a fragment of DNA or RNA, 

20 it may be present in a vector system, 'or it may be 
within the genome of an organism. 

The other aspects, the invention features 
vectors, and expression vectors or cells 
containing these vectors, each vector having the 

25 engineered nucleic acid, and the invention 

features peptides expressed from these vectors or 
cells. By peptide is meant a chain of two or more 
amino acids, including proteins and polypeptides. 
These peptides, and antibodies to these peptides, 

30 may be used as therapeutic or diagnostic agents. 

In preferred embodiments, the nucleic acid 
encodes for a peptide having a greater than 75% 
homology to a fragment of at least thirty amino 
acids of human mannose-binding protein; most 
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preferably the nucleic acid encodes for human 
mannose -binding protein. In other preferred 
embodiments the nucleic acid is cDNA; the 
hybridizing conditions are at 42 °C in 5 x SSC, 
5 with washing at 68°C in 0.1 x SSC; and the nucleic 
acid encodes a carbohydrate binding region. Most 
preferably the region has at least about 60 bases 
from region 309-714, shown in Fig. 1; the nucleic 
acid is ligated to nucleic acid encoding the toxic 

10 part of a toxin molecule, most preferably the 
toxin molecule is chosen from AZT, ricin, or 
cholera toxin; the cell is a virus, bacterium, 
fungus, or eucaryotic cell; the virus is vaccinia, 
the bacterium is Escherichia coli, the fungus is 

15 yeast, and the eucaryotic cell is a cultered cell 
line. 

In a related aspect, the invention features 
a fragment of at least about 60 contiguous bases 
of the nucleic acid encoding human mannose binding 
20 protein deposited in the ATCC as strain number 
67483 . 

In another aspect, the invention features a 
method for treating animals infected with a 
bacterium, fungus, or virus. The method entails 

25 providing a peptide able to bind the mannose units 
on these organisms . The peptide is able to cause 
host defensive cells to be attracted to the 
organisms. The method further entails 
administering the peptide to the animal. 

30 In preferred embodiments, the peptide is a 

fragment of human mannose binding protein able to 
bind a carbohydrate; this peptide is able to 
disable the bacterium, fungus, or virus, and is a 
peptide as described above. Most preferably, the 
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animal is human; the infection results in a 
bacteremia or local bacterial infection, parasitic 
infection, or fungal colonization, and the route 
of administration is either intravenous, 
5 intramuscular, oral, or local, i.e., in the form 
of a powder, or lotion; or the virus is HIV or a 
related virus, and the peptide lowers the rate of 
infection of eucaryotic cells by the virus; the 
protein or peptide is the mannose binding protein 

10 provided at 1-500 pg/ml final concentration in 
human serum or tissue. 

In another aspect, the invention features a 
method for diagnosing patients susceptible to 
invention by viruses, bacteria, parasites or 

15 fungi, the method features detecting the serum 
level of mannose-binding proteins in an animal, 
wherein this level reflects the susceptibility of 
the animal to an infection. 

Preferably, the method features detecting 

20 reaction of an antibody to the above peptides with 
the serum, most preferably the detecting comprises 
an EL ISA test. 

Other features and advantages of the 
invention will be apparent from the following 

25 description of the preferred embodiments thereof, 
and from the claims. 

Description of the Preferred Embodiments 



30 



The drawings will first briefly be described. 
Drawings 
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Fig. 1 is a restriction endonuclease map of 
the MBP-human cDNA insert in pMBP. 

Fig. 2 is a representation of the cDNA 
sequence and corresponding amino acid sequence of 
5 MBP human. 

Fig. 3 is a representation of the genomic 
DNA, and corresponding amino acid sequences in all 
three reading frames of human MBP. 

Fig. 4 is a diagram of a proposed model of 

10 MBP. 

Fig. 5 is a comparison of the amino acid 
sequences of MBP-human with other lectins; 
invariant regions are shown on the top line, and 
galactose and mannose-specif ic regions on the 
15 lower lines. 



Human Mannose -binding protein (MBP-human) 

MBP-human is a soluble lectin-like molecule 
which is synthesized in hepatocytes and released 

20 into the bloodstream. Generally, MBP-human is 
able to bind carbohydrates, such as mannose, at 
its carbohydrate binding domain. MBP-human can be 
isolated generally as described by Wild et al., 
supra, and Drickamer et al. supra, for example, by 

25 passage down a mannose-sepharose column. 

The general structure of MBP-human is shown 
in Fig. 4. The amino-terminal end 10 is cysteine rich, 
consistent with multimer formation by interchain 
disulfide bridges. Next to this is a collagen-like 

30 segment 12 having a repeated pattern of Gly-X-Y 
(Gly represents glycine; X and Y are other amino 
acids), similar to those of non-filbrillar 
collagen genes. Finally, there is a 
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carboxy-terminal carbohydrate recognition domain 
14. The mannose-binding domain is within the 
region . 

Nucleic acid, for example, DNA, encoding 
5 MBP-human can be isolated by standard techniques. 
For example, oligonucleotide probes specific for 
the nucleic acid may be constructed and used to 
probe either genomic or cDNA libraries, as 
described by Drickamer et al., supra . 

10 Altenatively, gene fragments from related genes 
can be used as probes. Preferably, the probe is 
homologous to a region of the carbohydrate binding 
domain of MBP-human. The clones isolated by this 
technique contain engineered nucleic acid. Once 

15 isolated, the gene encoding MBP-human is useful 
for producing recombinant MBP-human protein, or 
peptide fragments thereof. In addition, the 
nucleic acid can be modified by standard 
techniques in order to express modified peptides . 

20 Examples of cloning MBP -Human encoding 

nucleic acid are given below. These examples are 
not limiting to the invention and one skilled in 
the art will recognize that there are many 
equivalent means for accomplishing similar 

25 results . 

Example 1: cDNA clones 
A human liver cDNA library was constructed 
in pKT218 by standard technique as described by 
30 Woods et al. 79 Proc. Natl. Acad. Sci. USA. 5661 , 
1982. This library was probed using a gel 
purified radiolabeled rat MBP-C cDNA sequence 
digested with Xho l and EcoRI as described by 
Drickamer et al., supra . This probe was used 
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under non- stringent, conditions to identify- 
potentially useful clones . The filters were 
prehybridized for 1 hour at 42 °C in 0,75 M NaCl, 
50mM sodium phosphate, pH7.4, 5mM EDTA, 5x 
5 Dehardts solution and 0.1% SDS (5 x SSC), and then 
hybridized overnight at 42 °C. The filters were 
washed at 45°C in 2xSSC for 30 minutes and then in 
lxSSC for 30 minuteN n addition a AHEPG2 
gtlO cDNA library plated in E. coli C600 was 

10 screened, as described by Kwiatkowski et .al . 
323 Nature 455, 1986 . 

Five clones, including pMBP, were isolated 
and their sequences determined by the method of 
Sanger et al. (74 Proc. Natl. Acad, Sci. USA 5463, 

15 1977) using M13 , Mpl8 cloning vectors (Messing et 
al. Proc. Nat. Acad. Sci. USA 74:3642, 1977). 
This sequence is shown in Fig. 2. The restriction 
map of pMBP is shown in Fig. 1, it has a 3.6kb 
EcoR I insert isolated from the above \ gt 10 

20 library. 

Example 2: Genomic clone 
The 650 bp carboxy terminal Pst-1 fragment 
(Fig. 1) of a MBP-human cDNA clone was used as a 
probe for human genomic library. This library was 

25 contracted by standard techniques in EMBL 3 A by 

inserting Mbol-digested genomic DNA into the BamH I 
site. Clones which hybridized under stringent 
conditions were isolated. Specifically, the 
hybridization was performed as described above, 

30 except the wash conditions were at 68°C in O.lxSSC. 
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The positively identified clones were plague 
purified and their nucleic acid sequence 
. determined as above. This sequence is presented 
in Fig. 3. 

Other related genes can be isolated by this 
procedure. For example, the membrane receptor 
protein of macrophages is similar to MBP -human in 
"that its DNA hybridizes under less stringent 
conditions (using the above hybridization buffer 
at 37 °C) to MBP -human probes, and a peptide of 
similar size to MBP-human is immunoprecipitated 
with antisera to MBP-human. 

Expression of MBP-human peptide fragments is 
by standard procedure. For example, the desired 
region of the MBP-human encoding DNA, preferably 
the cDNA, can be isolated from one of the 
above-described clones and inserted into any one 
of several standard expression vectors. A 
preferred region for expression is that encoding 
the carbohydrate binding domain, most preferably 
the mannose binding domain. This region is 
between nucleotide bases 359-807 in Fig. 1, 
including a 350bp Pst l- Xba l fragment. To identify 
the desired region more specifically the sequence 
is compared to that in related proteins such as 
human mannose receptor. Comparison to rat A and C 
MBPs reveals most homology between other mannose 
binding proteins at the region equivalent to the 
collagen region, at nucleotide bases 287-359 (Fig. 
1). This region is not useful in the invention 
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sincs it: is not involved with mannose binding. 
Rather, the region from 359-807 in Fig. 1 is most 
useful . 

In order to show that any particular region 
5 of MBF-human does bind mannose the cDNA encoding 
it can be engineered by standard procedures to 
produce clones containing just this region. The 
resulting cloned DNA is then inserted into an 
expression vector. The peptide produced by such a 

10 vector is then passed through a mannose-sepharose 
column to see whether it will bind to mannose. 
Alternatively, a radioimmunoassay can be performed 
to see if radiolabelled mannose will react with 
the expressed peptide. Those peptides which bind 

15 mannose are useful in this invention. 

It is unlikely that a single short linear 
region of amino acids of the MBP -human peptide is 
involved in binding to mannose, rather two or more 
such regions will probably cooperate to form a 

20 three-dimensional peptide configuration which can 
interact with, and bind 7 mannose. Such regions 
can be identified by comparison to other mannose- 
binding proteins as described above, and the DNA 
fragment encoding all such regions cloned and 

25 expressed. Such a DNA fragment is likely to be at 
least 60-90 base pairs in length, encoding at 
least about 20-30 amino acids. 

Referring to Fig. 5, such a comparison was 
performed by comparing other lectins, with mannose 

30 or other sugar binding specif i ties, to MBP-human. 
The lower line of the figure shows a concensus for 
mannose binding proteins, the amino acids on this 
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line and in the upper line (showning invariant, 
amino acids ) are the most important for binding to 
mannose. These results were obtained by 
comparison of MBP-human to lectin proteins 
including the human and rat hepatic 
asialoglycoprotein receptors (Drickamer, 1987, 
Structure and biosynthesis of membrane receptors 
which mediate endocytosis of glycproteins , Kidney 
International, in press), the avian heptic 
receptor (Drickamer, 1987 supra ), the apoprotein 
of dog (Benson et al., Proc . Natl . Acad . Sci. USA 
82:6379, 1985) and human surfactant (White et al., 
Nature 317 :361, 1985); the NH 2 portion of a 
glalctose specific lectin isolated from the 
hemolymph of ^ periginia (Takahashi et al. , 
J- Biol. Chem. 260 :12228, 1985); a lectin isolated 
from the coelomic fluid of a sea urchin 
A^ crassispina (Giga et al . , J. Biol . Chem . 13: 
6197, 1987); a chicken cartilage core proteoglycan 
protein (Shigaku et al., Proc . Natl . Acad . Sci. 
USA 83:5081, 1986) and the IgE Fc receptor (Ikuta 
et al., Proc . Natl . Acad . Sci . USA 84:819, 1987). 

The above described mannose binding peptide, 
or the entire recombinant protein, is useful for 
specifically targeting cells expressing mannose on 
their surface , e.g., bacteri a , fungi , and 
viruses. Thus, by linking this peptide to 
molecules able to kill or inhibit growth of such 
cells a hybrid peptide of great therapeutic use 
can be .constructed. For example, the toxic part 
°f ricin and cholera toxin, or chemicals such as 
AZT can be linked to this peptide. In order to do 



» — r . 



WO 89/01519 _ PCT/US88/02591 



-11- 



this, the nucleic acid encoding such toxins can be 
ligated to the mannose binding pepti de-encoding 
nucleic acid and expressed as a single entity to 
form a hybrid peptide, for example, as described 
5 by Murphy U.S. Patent 4,675,382, hereby 

incorporated by reference. Alternatively, the two 
peptides can be synthesized separately and linked 
chemically, for example, as described by Ross U.S. 
Patent 4,275,000, hereby incorporated by 

10 reference. 

Expression vectors suitable for peptide 
expression include all standard bacterial (e.g., 
pKK233-2, Amann et al . Gene in press, sold by 
Pharmacia, 800 Centennial Avenue, Piscataway, NJ 

15 08854), yeast, and viral expression vectors, as 

well as eucaryotic vectors. Those skilled in the 
art will realize that such vectors generally are 
suitable for expressing the protein and the 
exSfciple below is not limiting to this invention. 

20 The full length or partial cDNA MBP clone, 

with or without toxin pepti de-encoding nucleic 
acid, can be ligated into the vector pSV2neo 
(Southern et al . J. Mol . Appl . Genet. 1:327 (1982) 
and Cloning Vectors, A Laboratory manual Ed. 

2 5 Pouwels et al . Elsevier Science Pub. NY, 52 
Vanderbilt Avenue, NY, NY 10017, 1985) which 
contains an origin of replication from pBR322 and 
an ampicillin resistance gene. It also contains 
SV40 sequences to provide a transcriptional 

30 promoter and a polyadenylation sequence. The DNA 
is inserted between these two sequences. After 
ligation the recombinant vector is propagated in 
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Escherichia coli and -then introduced into Chinese 
hamster ovary cells using a standard calcium 
phosphate trans fecti on protocol. The neo gene on 
this vector provides resistance to G418, which can 
5 be used to select for transformed cells . 

Expression of mannose binding peptides by 
these vectors and organisms can be followed using 
a sepharose-mannose column* Expressed material is 
bound to the column, eluted with 50mM Tris/lOmM EDTA, 

10 and run in 8% polyacryl amide gels (using Laemmli 
buffers, Nature 227 :600 , 1970) to observe the 
presence of peptides. Those clones which produce 
mannose-binding peptides, i.e., peptides which 
bind to such a column, are suitable in this 

15 invention. 

Antibodies to such expressed peptides or to 
MBP-human itself can be produced by standard 
techniques. They may be monoclonal or polyclonal 
and are useful for identification of the peptides 

20 within animal serum or in clinical diagnostic 
tests . 
Use 

Exposed mannose is a feature of the cell 
walls of many pathogens, whereas higher organisms, 

25 including humans and animals, tend to have 

processed membrane glycoproteins having complex 
sugars which mask internal mannose residues. 
These internal mannose residues are not recognized 
by MBPs. Recombinant mannose binding protein, or 

30 chimeric peptides containing the mannose binding 
domain, are useful therapeutic agents. These 
proteins or peptides specifically bind 
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mannose-rich pathogens, including bacteria, fungi, 
yeasts, parasites, or the envelope glycoproteins 
of certain viruses, and thus direct removal of 
such pathogens from the animal . 
5 For non-viral pathogens, efficacy of removal 

by host defense mechanisms may be increased by 
directing attachment of the mannose binding 
protein complex to the surface of phagocyte cells, 
thereby enhancing the clearance of the pathogens 

10 from the circulation, by causing the phagocytes to 
recognize these pathogens . 

For viruses, which express mannose-rich 
glycoproteins, direct inactivation of the virus 
and viral infected cells is enhanced by attaching 

15 toxins, such as ricin, cholera, or diptheria or 

antimetabolite drugs, such as AZT, to the mannose 
binding domain of the mannose binding protein. 

For example, the 350bp Pst l- Xba l fragment 
shown in Fig. 1, comprising the carboxy- terminal 

20 mannose binding domain of MBP-human can be 

expressed in an expression vector, and the peptide 
produced linked chemically to nucleotide analogues 
such as dideoxycytosine or AZT. As shown below, 
f luorescencely labelled such peptides do not bind 

25 to cells uninfected with HIV, the virus thought to 
cause Acquired Immunodeficiency Syndrome (AIDS), 
but do bind to infected cells. The resulting 
product should be particularly effective in 
specifically targeting drug-like molecules to HIV 

30 or HIV-infected cells. 

Example 3 : HIV targeting 
MBP-human was shown to be effective in vivo 
for preventing infection of H9 CD4 + cells with 
HIV. Purified HIV was incubated in the presence 
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or absence of highly purified homogenous MBP-human 
(prepared as described by Summerfield et al. r 
Biochimica et Biop . Acta 883:197, 1986; Wild et 
al. # Biochem. J. 210 :167, 1983; Townsend et al., • 
Biochem. 1M : 209, 1981; and Kawasaki et al., 
J. Biochem. 94:937, 1983). The treated virus was 
then incubated with H9 CD4 + lymphocytes (which are 
primary targets for HIV infection) and 7 days 
later viral infectivity was measured by a) the 
appearance of HIV envelope glycoprotein (which was 
assayed on the cell surface by immunofluorescence 
using specific anti-envelope glycoprotein 
antisera) and b) the presence of reverse 
transcriptase activity (which is present only when 
the cell is infected with HIV). MBP-human 
completely inhibited viral entry into cells. This 
was shown by the absence of HIV envelope 
glycoprotein on the cell surface, and by 
indectectable reverse transcriptase activity. 
Control experiments showed that the inhibition by 
MBP-human was* specific; these experiments involved 
completing MBP-human with mannose rich yeast 
maim an, and neo-glycoprotein mannose-BSA. 

In experiments using fluorescently-labelled 
MBP-human to observe binding to infected or 
uninfected cells, the fluorescently labelled 
MBP-human was used to show that mannan and 
mannose-BSA inhibits the binding of MBP-human to 
virally infected cells, and that MBP-human does 
not bind uninfected H9 cells. Thus MBP-human is 
recognizing exposed mannose units on these cells. 

Thus, MBP-human or the mannose binding 
domain thereof are suitable for identifying cells 
infected with HIV, or related viruses which 
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express mannose rich envelope glycoproteins on 
their cell surface. The MBP-human, the mannose 
binding domain or chimeric molecules thereof can 
be used to target cytotoxic agents to directly and 
5 specifically kill infected cells. Further, these 
molecules can be used to prevent the spread of 
viral infection, and even the initial infection 
itself. 

MBP-human and related peptides as described 

10 above may be administered by routine methods. For 
example, they can be injected directly into the 
blood stream of an animal, especially humans, to a 
level of between 1-SOOjjg/ml serum (most 
preferably, lSOpg/ml final concentration, and this 

15 dose repeated to maintain this level. They can be 
administered prophylaticaly or after infection. 
Similarly, the molecules may be administered 
orally, injected subcutaneous ly, or even applied 
in powder or lotion form, for example, to treat 

20 local infections, such as bacterial infection, or 
infection with Tr i chophyton rub rum , which causes 
athlete's foot. 

Another use of these peptides is in the 
determination of an animal 1 s susceptibility to 

25 infection by agents such as HIV. Here, the serum 
level of MBPS in the animal is measured using 
antibodies produced to MBP-human, or related 
peptides, in for example, an ELISA protocol. The 
level of MBPs in the serum can then be related to 

30 the susceptibility to infection of this animal to 
an agent, and this relationship used to estimate 
other animals 1 susceptibility. Thus, for example 
if a high level of MBP-human is linked to low 
•susceptibility to infection by HIV, then a human 
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having a low level of MBP-human is likely to be 
susceptible to HIV infection. Further, at the 
genomic level, such susceptibility may be related 
to defects in the nucleic acid. Such defects can 
5 be discovered using the cloned MBP-human genes, or 
fragments thereof, as probes. Polymorphisms 
linked to HIV susceptibility can be detected and 
used to predict susceptibility of other humans to 
infection. 

10 Deposits 

The following deposit was made on 
August 4, 1987 , with the American Type Culture 
Collection (ATCC), where the deposit was given the 
accession number ATCC 67483 . 

15 Applicants 1 assignee , - Children ' s Medical 

Center Corporation, represents that the ATCC is a 
depository affording permanence of the deposit and 
ready accessibility thereto by the public if a 
patent is granted- All restrictions on the 

20 availability to the public of the material so 
deposited will be irrevocably removed upon the 
granting of a patent. The material will be 
available during the pendency of the patent 
application to one determined by the Commissioner 

25 to be entitled thereto under 37 CFR 1.14 and 35 USC 
122. The deposited material will be maintained 
with all the care necessary to keep it viable and 
uncontaminated for a period of at least five years 
after the most recent request for the furnishing 

30 of a sample of the deposited microorganism, and in 
any case, for a period of at least thirty (30) 
years after the date of deposit or for the 
enforceable life of the patent, whichever period 
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is longer- Applicants' assignee acknowledges its 
duty to replace the deposit should the depository 
be unable to furnish a sample when requested due 
to the condition of the deposit. 
5 Other embodiments are within the following 

claims . 
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Claims 



1. Engineered nucleic acid encoding for at 
least about 20 contiguous amino acids of human 

5 mannose-binding protein, 

2. Engineered nucleic acid comprising at 
least about 60 bases able to hybridize under 
hybridizing conditions to nucleic acid encoding 
human mannose-binding protein. 

10 3 - The nucleic acid of claim 1 or 2 wherein 

said nucleic acid encodes a peptide having greater 
than 75% homology to a fragment of at least thirty 
amino acids of said human mannose-binding protein. 

4. The nucleic acid of claim 2 wherein said 
15 nucleic acid encodes said human mannose-binding 

protein. 

5. The nucleic acid of claim 4 wherein said 
nucleic acid is cDNA. 

6. The nucleic acid of claim 2 wherein said 
20 nucleic acid comprises at least about 90 bases 

able to hybridize under hybridizing conditions to 
human nucleic acid encoding mannose-binding 
protein* 

7. The nucleic acid of claim 2 wherein said 
25 hybridizing conditions comprise 42 °C in 5 x SSC, 

with washing at 68 °C in 0.1 x SSC, 

8. A nucleic acid fragment of at least 
about 60 contiguous bases of the nucleic acid 
encoding human mannose binding protein deposited 

30 in the ATCC as strain number 67483. 
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9. The nucleic acid of claim 6 wherein said 
nucleic acid encodes a carbohydrate binding region. 

10. The nucleic acid of claim 9 wherein said 
region comprises at. least about 60 bases from 

5 region 359-807 shown in Fig. 1. 

11. The nucleic acid of claim 10 wherein 
said nucleic acid is ligated to nucleic acid 
encoding the toxic part of a toxin molecule. 

12* The nucleic acid of claim 11 wherein 
10 said toxin moleucule is chosen from AZT, ricin, or 
cholera toxin. 

13. A vector or an expression vector 
comprising the nucleic acid of claims 1, 2, 8, 9, 
10, 11, or 12. 

15 14. A peptide encoded by the nucleic acid of 

claim 1, 2, 8, 9, 10, 11, or 12. 

15. A cell comprising the nucleic acid of 
claim 1, 2, 8, 9, 10, 11, or 12. 

16. A cell comprising the peptide of claim 

20 14. 

17. The cell of claim 15 wherein said cell 
is bacterium, fungus, or eucaryotic cell. 

18. The cell of claim 16 wherein said virus 
is vaccinia, said bacterium is Escherichia coli, 

25 said fungus is yeast, and said eucaryotic cell is 
a cultered cell line. 

19. A therapeutic agent comprising the 
peptide of claim 14. 

20. A method for treating animals infected 
30 with a bacterium, a fungus, or a virus, 

said method comprising 

administering to said animal a peptide able 
to bind to mannose. 
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21. The method of claim 20 wherein said 
peptide comprises a fragment of human mannose 
binding protein able to bind a carbohydrate. 

22. The method of claim 21 wherein said 
5 peptide inhibits growth or infection of said 

bacterium, fungus, or virus. 

23 . A method for treating animals infected 
with a bacterium, a fungus, or a virus, said 
method comprising administering to said animal a 

10 peptide of claim 14. 

24. The method of claim 20 wherein said 
animal is human. 

25. The method of claim 24 wherein said 
peptide is administered by application of a powder 

15 comprising said peptide to said foot. 

26. The method of claim 23 wherein said 
virus is HIV and said peptide lowers the rate of 
infection of eucaryotic cells by said virus . 

27. The method of claim 24 wherein said 
20 administration is local, intravenous,* 

intramuscular or oral. 

28. Antibodies binding to the peptide of 
claim 14. 

29. A method for diagnosing susceptibility 
25 to infection by viruses, bacteria, or fungi, said 

method comprising detecting the serum level of 
mannose-binding proteins. 

30. The method of claim 29 wherein said 
detecting comprises detecting reaction of an 

3 0 antibody of claim 15 with said serum. 

31. The method of claim 30, wherein said 
detecting comprises of ELISA test. 
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32. The method of claim 26 wherein said 
peptide is mannose-binding protein provided at 
l-500(jg/ml final serum concentration in said 
human. 
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FIG, 2 

get egg taa ata tgt etc ate aac tga gat taa cct tec ctg agt 
ttt etc aca cca agg gag acc ATG TCC TGT TTC ATC ACT CCC TCT 

Met Ser Cys Phe lie Thr Pro Ser 

1 — TJ5 

CTT CTC CTG AGT ATG GTG GCA GCG TCT TAC TCA |GAA| ACT GTG ACC 
Leu Leu Leu Ser Met Val Ala Ala Ser Tyr Ser 



GAA 
Glu 



Thr Val Thr 
180 

TGT GAG GGT GCC CAA AAG ACC TGC CCT GCA GTG ATT GCC TGT AGC 
Cys Glu Gly Ala Gin Lys Thr Cys Pro Ala Val lie Ala Cys Ser 
y 225 
TCT CCA GGC ATC AAC GGC TTC CCA GGC AAA GAT GGG CGT GAT GGC 
Ser Pro Gly He Asn Gly Phe Pro Gly Lys Asp Gly Arg Asp Gly 

270 

ACC AAG GGT AGA AAA GGG GGA ACA GGT CAA GGG CTC AGA GGC TTA 
Thr Lvs Gly Arg Lys Gly Gly Thr Gly Gin Gly Leu Arg Gly Leu 

315 

CAG GGC CCC CCT GGA AAG TTG GGG CCT CCA GGA AAT CCA GGG CCT 
Gin Civ Pro Pro Gly Lys Leu Gly Pro Pro Gly Asn Pro Gly Pro 

y360 

TCT GGG TCA CCA GGA CCA AAG GGC CAA AAA GGA GAC CCT GGA AAA 
Ser Glv Ser Pro Gly Pro Lys Gly Gin Lys Gly Asp Pro Gly Lys 

405 

AG 7 CCG GAT GGT GAT AGT AGC CCA GGC TGC CTC AGA AAG AAA AGC 
Ser Pro Asp Gly Asp Ser Ser Pro Gly Cys Leu Arg Lys Lys Ser 

450 

TCT CCA AAC AGA AAT GGC ACG TAT CAA AAG TGC CTG ACC TTC TCT 
Ser Ala Asn Arg Asn Gly Thr Tyr Gin Lys Cys Leu Thr Phe Ser 

495 

CTG GGC AAA CAA GTT GGG AAC AAG TTC TTC CTG ACC AAT GGT GAA 
Leu Glv Lys Gin Val Gly Asn Lys Phe Phe Leu Thr Asn Gly Glu 

540 

ATA ATG ACC TTT GAA AAA GTG AAG GCC TTG TGT GTC AAG TTC CAG 
He Met Thr Phe Glu Lys Val Lys Ala Leu Cys Val Lys Phe Gin 

585 

CCT CTG TGG CCA CCC CCA GGA ATG GCT GCA GAG AAT GGA GCC ATT 
Pro Leu Trp Pro Pro Pro Gly Met Ala Ala Glu Asn Gly Ala He 

630 

CAG AAT CTC ATC AAG GAG GAA GCC TTC CTG GGC ATG CCT GAT GAG 
Gin Asn Leu He Lys Glu Glu Ala Phe Leu Gly Met Pro Asp Glu 

675 

AAG ACA GAA GGG CAG TTT GTG GAT CTG ACA GGA AAT AGA CTG ACC 
Lys Thr Glu Gly Gin Phe Val Asp Leu Thr Gly Asn Arg Leu Thr 

TAC ACA AAC TGG AAC GAG GGT GAA CCC AAC AAT GCT GGT TCT GAT 
Tyr Thr Asn Trp Asn Glu Gly Glu Pro Asn Asn Ala Gly Ser Asp 

765 

GAA CAT TGT GTA TTG CTA CTG AAA AAT GGC CAG TGG AAT GAC TCC 
Glu His Cys Val Leu Leu Leu Lys Asn Gly Gin Trp Asn Asp Ser 

810 

CCT TGC TTC CAC CTC CCA TCT GCC GTC TGT GAG TTC CCT ATC |tga| 
Pro Cys Phe His Leu Pro Ser Ala Val Cys Glu Phe Pro He 

855 

agg g tc tgt g a g ttc cct ate tg a agg gtc ata tea etc agg ccc 
tec ttg tct ttt tac tct acc aca gec cac gta tgc ttg aaa gat 

aat gat aat ata tea ttc tea gat cag tac ctg cag atg aaa ata 
tea gat cag tac ctg cag atg aag ata aga egg cat tta ttt ttc 

cat tta caa caa aca cct gtg tgt tga gec tta ctt tct gtt tgg 

gta gag ggc tec cct aat gac atg acc aca gtt taa tac cac age 

ttt ttt acc aag ttt cag gta tta aga aaa tc t att ttg taa ctt 

tct eta tga act ctg ttt tct ttc taa tga ga |t att aaaj cca tgt 

aaa aaa aaa aaa aaa aaa 993 
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FI6. 3 

10 20 30 40 SO 60 

CGCCAGAAAGTAGAGAGGTATTTAGCACTCTGCCAGGGCCAACGTAGTAAGAAATTTCCA 
Ar gGl nLysVal Gl uArgTyrLeuAl ALeuCysGl nGl yGl n ArgSerLysLysPhePro 
Al aArgLysEndArgGl y 1 1 •EndHi sSer Al aAr gAl aAsnVal Val Ar gAsnPheGJ 
Pr oGl uSer Ar gGl uVal PheSerThr LeuProGl yPr oThrEndEndGl u 1 1 eSer A 



70 80 90 100 HO 120 

GAGAAAATGCTTACCCAGGCAAGCCTGTGTAAAACACCAAGGGGAAGCAAACTCCAGTTA 
Gl uLy*««tLeuThrGl nAl aSerL»uCysLy*ThrProAr gGl ySarl.ysl.cuGl nLeu 
nAr gLy»CysLeuPr oAr gGlnAl aCysVal LysHi *G1 nGl yGl uAl AAsnSvrStrEn 
rgGl uA»nAl aTyr Pr oGl yLy«Pr oVal EndA»nThrUy*Gl yLy»Gl nThrProVal A 



130 140 150 1AO 170 180 

ATTCTGGGCTGGGTTGGTGACTAAGGTTGAGGTTGATCTGAGGTTGAGACCTTCCTCTTT 
1 1 eLeuGl yTrpVal Gl yAspEndGl yEndGl yEndSerGl uVal Gl uThr PheLeuPhe 
dPheTrpAlaGlyL»uValThrLy»ValGluVal A»pLeuArgLeuArgPr^SerSerLe 
snSer Gl yLeuGl y Tr pEndLeuAr gLeu Ar gLeu 1 1 eEndGl y End AspLeuPr oLeuT 



190 20O 210 220 230 240 

GGATCACCAGCTTTCAGCTCAGGGCCTGCCAATGAGTTTAAATGATAGTTAACAGGTCCT 
Gl ySer Pr oAl aPheSer SerGi yProAl aAsnGl uPheLysEndEndLeuThr Gl yPr o 
uAspHi sGl nLeuSer Al aGlnGl yLeuProfletSerLeuAsnAspSerEndGl nValLe 
rpll eThr Ser PheGl nLeuAr g Al aCysGl nEnd Val Endflet 1 1 eVal A*n ArgSer T 



250 2oO 270 280 290 300 

GGAGGGGAATCAGCTGCCCAGATCAAAGATGGGATTCAGGTGGCAGATGGACCCGAAGAG 
Gl yGl yGl uSerAl aAl aGl nl 1 eLysAspGl y II eGl nVal Al aAspGl yPr oGl uGl u 
uGl uGl yA*nGl nLeuProArgSerLysfletGl yPheArgTrpGlnMetAspProLysAr 
rpAr gGl y 1 1 eSerCysProAspGl nArgTrpAspSer Gl yGl yAr gTrpThr ArgArgG 



3 1 0 320 330 340 350 360 

GACATGGAGAGAAAGAGGAAGAGGAAGCTCCTACAGACACCTGGGTTTCCACTCATTCTC 
AspMetGl uAr gLysAr gLysArgLysLeuLeuGl nThrPr oGl yPhePr oLeuI 1 tLeu 
gThrTrpArgGl uArgGl yArgGlySerSerTyr ArgHi sLeuGlyPheHi sSerPheSe 
1 yHi sGl yGl uLysGl uGl uGl uGl uAl aPr oThr AspThr TrpVal Ser ThrHi sSerH 



370 380 390 4O0 410 420 

ATTCCCTAAGCTAACAGGCATAAGCCAGCTGGCAATGCACGGTCCCATTTGTTCTCACTG 
1 1 ePr oEndAl aAsnAr gHi sLysProAl aGl yAanAl aAr gSerHi sLeuPheSerLeu 
rPhePr oLy sLeuThrGl y 1 1 oSer Gl nLeuAl aMetHi sGl yProI 1 eCysSer Hi sCy 
i sSerLeuSerEndGl nAl aEndAl aSer TrpGl nCy»ThrVal Pr oPheVal LeuThr A 



430 440 450 460 470 480 

CCACGGAAAGCATGTTTATAGTCTTCCATGTTTATAGTCTTCCAGCAGCAACGCCAGGTG 
Pr oAr gLymAl aCysLeuEndSer Ser MetPhe 1 1 eVal PheGl nGl nGl nAr gGl nVal 
sHi sGlyLyeHi sVal TyrSerLeuProCysLeuEndSerSerSerSer AsnAl aAr gCy 
1 aThr Gl uSerMetPh e 1 1 eVal PheHi *Val Tyr Ser LeuPr oAl aAl aThrProGl y V 



490 SOO 510 520 530 540 

TCTAGGCACAGATGAACCCCTCCTTAGGATCCCCACTGCTCATCATAGTGCCTACCTTTG 
Ser Ar gHi »Ar gEndThr Pr oProEndA»pProHi «Cy«SerSerEndCy«i-euProLeu 
sLeuGl yThr AspGl uPr oi_euLeuAr g 1 1 eProThr Al .Hi sHi »Ser Al mTyrLeuCy 

alEndAlaGlnMetAsnProSerLeuGlySerProLenLeuIlelleValProThrPhe 
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530 360 570 380 390 600 

TTAAAGTACTASTCACGCAGGTTCACAAGGAATGTTTACTTTTCCAAATCCCCAGCTAGA 
LeuLysTyrEndSer ArgArgPhwThr Ar gAsnVal TyrPheSerLysSerProAl aArg 
sEndSer ThrSerHi sAl aGl ySer Gl nGX yMetPheThrPhePr oAsnProGl nUsuGl 
al LysVal LeuVal ThrGl nVal Hi *Ly sGX uCysLeuLeuPheGl nil eProSer End A 



610 620 630 640 630 660 

GGCCAGGGTGGGTCATCTATTTCTATATAGCCTGCACCCAGATTGTAGGACAGAGGGCAT 
Gl yGl nGl yGl ySerSer 1 1 eSer 1 1 «EndPr oAl aProArgLeuEndAsp Ar gGl yHi s 
uAl aArgVaX Gl yHi sLeuPheLeuTyrSerLeuHi »ProA«pCy»ArgThrGl uGl yM« 
r gProGl yTrpVaX 1 1 eTyrPh«Tyr 1 1 «A1 aCy«ThrGln I 1 «Val Gl yGl nArgAl aC 



670 680 690 700 710 720 

GCTCGGTAAATATGTGTTCATTAACTGAGATTAACCTTCCCTGAGTTTTCTCACACCAAG 
Al aAr gEnd 1 1 eCysVal Hi sEndLeuArgLeuThrPh«ProGluPhtS«rHi »Thr Uy» 
tL»uGXyLy3TyrVaIPhoIl»A»nEndA9pEndProS«rLeuS«rPh«L*uThrProAr 
ysSerVal AsnMetCy*SerL»uThrGl u 1 1 ©A*nL»uProEndVal Ph*S«rHi sGl nG 



730 740 730 760 770 780 

GTGAGACCATGTCCTGTTTCATCACTCCCTCTCTTCTCCTGAGTATGGTG6CAGCGTCTT 
Val ArgProCysProVal SerSer LeaPr oLeuPheSerEnd Val TrpTrpBl nAr gLeu 
gEndAspHi sVal LeuPheHi sHi sSerLeuSerSerProGl uTyr Gl yGl yS«rVal Le 
1 yGl uThrfletSerCysPhelleThrProSerLeuLeuLeuSerMttVal Al aAl aSerT 



790 800 810 820 830 840 

ACTCAGAAACTGTGACCTGTGAGGGTGCCCAAAAGACCTGCCCTGCAGTGATTGCCTGTA 
ThrGl nLysLeuEndPr oVal ArgVal Pr oLysAr gProAlaLeuGl nEndLeuPr oVal 
uLeuArgAsnCysAspLeuEndGlyCysProLysAspLeuProCysSerAspCysLeuEn 
yrSerGl uThr Val ThrCysGl uGl yAl aGlnLysThrCysProAl aVal 1 1 eAl aCy*S 



830 860 870 880 890 900 

GCTCTCCAGGCATCAACGGCTTCCCAGGCAAAGATGG6CGTGATGGCACCAAGGGTAGAA 
Al aLeuGl nAl aSer ThrAl aSerGl n AX aLysNetGl y Val Met Al aProArg Val Gl u 
dLeuSer ArgHi sGl nArgUeuProArgGl nArgTrpAl aEndTrpHi sGl nGl yEndLy 
erSer ProGl y II eAsnGl yPheProGl yUysAspGl yArgAspGl yThrUysGl yAr gL 



910 920 930 940 9SO 960 

AAGGGGGAACAGGTACGT6TTGGGCTGTTCTGTCTCTGCAATTCTTTACCTTCCAGAGGA 
LysGl yGl uGl n Val Ar g Val Gl yLeuPheCy sLeuCysAsnSer UeuPr oSer Ar gGl y 
sAr gGl yAsnAr gTyrVal UeuGl yCysSerVal Ser Al all eLeuTyrLeuProGl uGl 
ysGlyGlyThrGlyThrCysTrpAlaValLeuSerLeuGlnPhePheThrPh«GlnArgL 



970 9SO 990 1000 1010 1020 

AACTGCCTGGGGATATGAGGAGACGGATGTCCTATTTGAGTATATTTTTCTCAACTATAC 
AsnCysLeuGl yll eEndGl yAspGl yCysProIl eEndVal TyrPh»SerGlnL»uTyr 
uThr Al aTrpGl yTyr Gl uGl uThr AspVal LeuPheGl uTyr 1 1 •PheLeuAsnTyr Th 
yaLeuPr oGi yAspMet Arg ArgArgMetSer TyrL©uSer 1 1 ePh«PheSer Thr I leL 



1030 1040 
TGTAACTCAAAACAGAGATTCAGCTC 
Cy*A«nSerLysGlnArgPh«S r 
rVal ThrGl nAsnArgAspStr Ala 
euEndLeuLvsThr Gl u 1 1 eGlnLeu 
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10 20 30 40 SO 60 

GAATTCCACACAGCAGTTTGTGACTAATAGTTGTCTTGCCAGCCCAGGAAAGTGGCCCAC 
Gl uPheHi sThr Al aVal CysAspEndEndLeuSerCysGl nPr oAr gLy sVal Al aHi s 
AsnSer Thr Gl nGl nPheVal Thr AanSerCysLeuAl aSerPr oGl yLysTrpProTh 
1 1 ©ProHi sSerSerLeuEndLeuI 1 eVai Vai LeuProAl aGl nGl uSerGl yProG 

70 BO 90 100 1X0 120 

AGGTCAGGCCATCCCGTGGGACACAGGATGAATTTTTCTTCTCTGGGTCATTGTCATGTC 
ArgSer Gl yHi sProVal Gl yHi «ArgMet AanPheSerSerLeuGl yHi sCy*Hi sVal 
rGl yGl nAl a 1 1 eProTr pAapThrGl y£nd 1 1 tPh«LffuL»uTrpV«l 1 1 eVal MetSe 
1 nVal Ar gProSer ArgGl yThrGl nAspGl uPhePhePheSerGl yStr LeuSerCysG 



130 140 ISO 160 170 1B0 

AGACCCCTATTCACTTCAGTAGGGATGGCACCAGCGGCAACACCAAAGAAGAGATGGAGT 
Ar gPr oLeuPheThr SsrS/al Gl yflet Al aProAl aAl aThrPr oLysLysAr gTrpSer 
r AspPr oTyrSerLeuGl nEndGl yTrpHi sGl nArgGl nHi sGl nAr gAr gAspGl yVa 
InThrProIl »Hi aPheSer ArgAspGl yThrSerGl yA»nThrl.y»Gl uGl ufletGl uS 



190 200 210 220 230 240 

CAGCAAACAAACATAGGTTTTACTGGGGGAATCTGTTTACAGGGAGATCCAGCAGCAGTG 
Gl nGl nThr Asn 1 1 eGl yPheThrGl yGl y 1 1 ©CysLeuGl nGl yAspPr oAl aAl aVal 
1 SerLysGl nThrEndVal LeuLcuGl yGl uSerVal Tyr ArgGl ul 1 eGl nGl nGl nTr 
er Al aAsnLysHi sArgPheTyrTrpGl yAsnLeuPheThrGl yAr gSerSerSerSerG 



250 260 270 2B0 290 300 

GGCTG6ACAGGAGAACAACAACTACTGGTAAAAACAAATGCAGTTAATTTTCACTTTGCA 
Gl yTrpThrGl yGl uGl nGl nLeuLeuVal LysThr AsnAl aVal AsnPheHi sPheAl a 
pAlaGl yGlnGluAsnAsnAsnTyrTrpEndLysGlnttetGlnLeuIlePheThrLeuHi 
1 yLeuAspArgArgThrThrThrThrGl yLysAsnLysCysSerEndPheSerLeuCysT 



31 0 320 330 340 350 360 

CCCTCCCTGCAGCAACCTCCACGTGGCAATTTATTTCTTAAGTTATTGCTCTCAGGTGCA 
Pr oSer LeuGl nGl nProProAr gGl yAsnLsuPhsLeuLysLeuLeuLeuSerGl yAl a 
sProProCysSerAsnLeuHi sVal Al al 1 eTyrPheLeuSer TyrCysSerGl nVal Hi 
Mr LeuProAl aAl aThrSerThrTrpGl nPhel 1 eSerEndVal 1 1 eAl aLsuAr gCysT 



370 380 390 400 410 420 

CACCATACAGTTATTGA6AGCAGTGCTCAGAAAGGTCAGTCCTGGGTCAAGGTCTCCCTT 
Hi sHi sThrVal 1 1 eGl uSerSer Al aGl nLysGl yGl nSer TrpVal LysVal SerLeu 
sThr IleGlnLeuLeuArgAl aValLeuArgLysVal SerProGl ySer ArgSerProPh 
hr Pr oTyr SerTyrEnd61 uGlnCysSer 61 uArgSerVal UeuGl yGl nGl yLeuProS 



430 440 430 460 470 480 

CTCCTGAGAAGGGATTGGGCATCAAACTCTTGAAGAAGAGAGCAAGAACTAGATATTAAG 
LeuLeuAr g Ar g AspTrp Al aSer AanSerEnd Arg ArgGl uGl nGl uLeuAsp 1 1 eLys 
eStr EndGl uGl y 1 1 eGl yHi sGl nThr LeuGl uGl uGl uSerLy sAsnEnd 1 1 eLeuSe 
erProGluLysGlyLeuGl ylleLysi-eui-euLysLysArgAl aArgThr ArgTyrEndV 



490 500 51 0 S20 S30 540 

TCACATTTCCTTTGTCTTCCAACAGGCCAAGGGCTCAGAGGCTTACAGGGCCCCCCTGSA 
Ser Hi mPheL uCysLeuPr oThr Gl ySl nSl yLeuAr g61 yLetiGl nS I yPr oPr oGl y 
rHi »ll s r PheVal Ph*Gl nGl nAl aLysGl ySerGl uAi a Tyr Ar-gAl aPr oLeuGl 

alThrPheProLeuSerSerAsnArgProArgAiaGlnArgLeuThrGlyProProTrp 1 
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550 560 570 5S0 590 600 

AAGTTGGGGCCTCCAGGAAATCCAGGGCCTTCTGGGTCACCAGGACCAAAGGGCCAAAAA 
LysLeuGl yProProGl yAsnProGl yProSerGl ySerProGl yProLysGl yGl nLys 
uSer Tr pGl yLeuGl nGl u 1 1 eGl nGl yLeuLeuGl yHi sGl nAspGl n Ar g Al aLysLy 
ysVal Gl yAl aSerArgLysSer ArgAl aPheTrpVal Thr Ar gThrLysGl yProLysA 

610 620 630 640 650 660 

GGAGACCCTGGAAAAAGTCCGGGTAAGGACCCCAGCAAGGTCTGAGCTGACTTCACCCAG 
Gi yAspProGl yLysSerProGl yLysAspProSerLysVal EndAl aAspPheThrGl n 
sGluThrLeuGl uLysVal ArgVal ArgThrProAl aArgSerGluLeuThrSerProAr 
rg ArgProTrpLysLysSerGl yEndGl yPr oGl nGl nGl yLeuSerEndLeuHi sProG 

670 680 690 700 710 720 

GGTCCTGAGACCTTGAGTATCTGGTAAGAGGTGCCCCTTCTCCTGTTCCTTCAAAGGAAG 
Gl yProGl uThrLeuSer 1 1 eTrpEndGl uVal ProLeuLeuLeuPheLeuGl nArgLys 
gValLeuArgProEndVal SerGl yLysArgCysProPheSerCysSerPheLysGl yAr 
1 ySerEnd AspLeuGl uTyrLeuVal Ar gGl yAl aPr oSerPr oVal ProSer LysGl uA 



730 740 750 760 

ATACCCAAATTTGCTTTCTGACCCAGTGCCCTCAGCCCTCTC 
IleProLysPheAl aPheEndProSer Al aLeuSerProLeu 
gTyrProAsnLeuLeuSer AspProVal ProSer Al aLeu 
spThrGl nil eCysPheLeuThr Gl nCysProGl nProSer 
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